Dynamic scaling, data-collapse and self-similarity in Barabasi-Albert networks 
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In this article, we show that if each node of the Barabasi-Albert (BA) network is characterized by 
the generalized degree q, i.e. the product of their degree k and the square root of their respective birth 
time, then the distribution function F{q,t) exhibits dynamic scaling F(q,t oo) ~ t^^^^4'{q/t^^'^) 
where 4>[^) is the scaling function. We verified it by showing that a series of distinct F{q,t) vs q 
curves for different network sizes A'^ collapse onto a single universal curve if we plot t^/'^F{q,t) vs 
q/t^^"^ instead. Finally, we show that the BA network falls into two universality classes depending 
on whether new nodes arrive with single edge (m = 1) or with multiple edges (m > 1). 

PACS numbers: 61.43.Hv, 64.60. Ht, 68.03.Fg, 89.75.Da 



Many complex systems can be described as an inter- 
woven web of a large network if the constituents are re- 
garded as nodes or vertices and the interactions between 
constituents as links or edges. For example, cells of liv- 
ing systems are networks of molecules linked by chemi- 
cal interaction [H-Q , the Internet is a network of routers 
and computers linked by cables or wireless connections 
the power-grid is a network of substations linked by 
transmission lines and social networks are networks 
of individuals linked by friendships, professional ties, etc 
P-[Toj. The first theoretical attempt to guide our under- 
standing about complex network topology began with 
the seminal work of Paul Erdos and Alfred Renyi in 1959 
The main result of the Erdos- Renyi (ER) model is 
that the degree distribution P{k), defined as the prob- 
ability that a randomly chosen node is connected to k 
other nodes by one edge, is Poissonian revealing that it 
is almost impossible to find nodes that have significantly 
higher or fewer links than the average degree. However, 
real networks are neither completely regular where all 
the nodes have the same degree k nor completely ran- 
dom where P{k) is Poissonian instead they are scale-free 
in character where P{k) obeys power-law. 

Just over a decade ago Barabasi and Albert revolu- 
tionized the notion of the network theory by recognizing 
the fact that natural and man-made networks are not 
static rather they grow by continuous addition of new 
nodes. They further argued that the new nodes estab- 
lish links to the well-connected existing ones ■preferen- 
tially rather than randomly known as the preferential at- 
tachment (PA) rule. It essentially embodies the intuitive 
idea of the ri ch g et richer principle of the Matthew effect 
in sociology [12]. BA then presented a simple theoreti- 
cal model incorporating both the ingredients and showed 
that the resulting network can reproduce the power-law 
degree distribution which most real life networks exhibit 
[11 [3 • Recently, we have shown that random sequential 
partition of a square into contiguous and non-overlapping 
blocks can be described as a network with power-law de- 
gree distribution if blocks are regarded as nodes and com- 
mon border between blocks as links jl5| . 

A power-law distribution function is considered as 



scale-free since it looks the same regardless of the scale 
we look at it. In general, a function is called scale- free if 
it satisfies 

!{\x)^g{\)f{x). (1) 

It can be rigorously proved that such function can only 
have none but power-law solution [T6| . The kinetic view 
of network topology and the new terminology, scale-free 
network, has attracted physicists, mathematicians and 
computer scientists which resulted in a surge of research 
activities On the other hand, a function f{x,t) 

is said to obey dynamic scaling if one of the variable t 
strictly denotes time and if it has the form 

f{x,t)^tU{x/n, (2) 

where exponents 9 and z are fixed by the dimensional re- 
lations [t^] = [/] and [t^] = [x] respectively, while </>(^) is 
known as the scaling function [131 . There exists yet an- 
other scaling hypothesis, known as the finite-size scaling 
(FSS), that has been extensively used as a very power- 
ful tool for estimating finite size effects in the critical 
phenomena. Within the FSS formalism, a function with 
exactly the same form as in Eq. ^ is said to obey finite- 
size scaling if x, though typically denoted by e, measures 
the distance from the critical point of the phenomenon 
under investigation and t, though typically denoted by 
L, describes the linear size of the system jisl. [l9j. 

By definition, the BA model describes a time develop- 
ing phenomenon and hence, besides its scale-free prop- 
erty, one could also look for its dynamic scaling property. 
This aspect of the BA model, however, has never been 
examined. To this end, we argue that each node in the 
dynamic network can be better characterized by gener- 
alized degree g, the product of the square root of the 
birth time of each node and their corresponding degree 
k, instead of the degree k alone since the time of birth 
matters in the BA network. We find that the generalized 
degree distribution F{q, t) has some non-trivial features 
and exhibits dynamic scaling F{q,t) ^ t^^/^(j){q/t^^'^) in 
the long-time limit. We have verified it using the idea of 
data-collapse. For instance, we show that a series of dis- 
tinct curves F{q, t) versus q for different network sizes N 
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FIG. 1: Plots of the cumulative degree distributions CD(k) = 
P{k' > k) is shown for the BA model with m = 1. The data 
points are averaged over 500 independent realizations. 



can be made to collapse onto a single universal curve if we 
plot t^/'^F against q/t^/"^. In addition, we find that it pro- 
vides a means to classify the BA networks into different 
universality classes which are otherwise regarded as the 
same. Establishing self-similarity and data-collapse or 
finding the scale-free property in any system has always 
been regarded as a significant p rog ress towards gaining 
deeper insight into the problem [20, . 

The BA model begins with a small number of nodes 
mo as seeds which are already linked. Then, at each time 
step a new node with m < mg edges is added to the exist- 
ing network. Edges of each new node are attached with 
m different existing nodes by picking them preferentially 
with respect to their degree k. That is, the probability 
that a new node will be connected to an already exist- 
ing node i is proportional to its degree fcj and hence the 
degree ki of the node i changes following the dynamical 
equation 



dkj 
dt 



(3) 



Note that every time a node is added to the system it 
adds m edges contributing to the increase of 2m degrees 
and hence ^ kj — 2m (t — 1). Solving equation Q 

in the long time limit and using the fact that node i is 
born at time ti with degree m, i.e. ki(ti) = m, gives 



hit) = nT-y-j-) I 



(4) 



where f3 = 1/2. It implies that the degree of a node i 
depends not only on the progressing time t, but also on 
the birth time U. 

We can find the degree distribution P{k, t) by appre- 
ciating the fact that it is related to the homogeneous 
probability function P{ti) = j (nodes are added at equal 
time intervals) by 



Here, the minus sign is introduced to take into account 
the fact that the smaller the ti the larger the degree ki 
in the statistical sense. From equation (jG]) we can easily 
find 

dti 



dki 



1 



and then substituting it as well as P{ti 
immediately gives 

P{k) - 2m^k-''' with 7 



'tK ' , (6) 
= \ into Eq. © 



(7) 



since (3 = 1/2. We thus find that P{k) is independent of 
time t albeit k depends on t and it satisfies the scale-free 
form given by Eq. ([T]). It means that the network in the 
long time limit self-organizes into a scale-free state where 
it no longer depends on time. 

It is interesting to note that the degree distribution 
P{k) typically has a long tail with relatively scarce data 
points which turns into a fat-tail when we plot P(fc) in 
log-log scale. This complicates the process of identifying 
the range over which the power-law holds and hence esti- 
mating the exponent 7. One way of reducing the noise at 
the tail-end is to plot cumulative distribution P{k' > k) 
which is related to degree distribution P(fc) via 



P{k) = - 



dP{k' > k) 
dk ■ 



(8) 



Figure 1 shows that ln(P(fc' > fc)) decays linearly against 
In(fc) with slope equals to 7 — 1 = 2 as expected ac- 
cording to equation ([7]) except at the tail-end due to 
finite-size effect. Furthermore, it shows that the extent 
of linearity increases as network size N increases reveal- 
ing that the sudden fall off near the tail-end is indeed 
due to finite-size effect. Waclaw and Sokolov in [ll] sug- 
gested that the finite size effect in the degree distribu- 
tion of the BA network can be well approximated by 
Pnik) = P{k)w{k/\/N) where the cut-off function w{x) 
is found highly sensitive to both m and mo. 

We thus find that the degree distribution P{k,t) self- 
organizes into a time invariant state in the long-time 
and large-size limit instead of a state where it assumes 
a dynamic scaling form. We therefore have to look for 
another quantity which exhibits dynamic scaling in the 
BA model. Note that a complete characterization of the 
nodes in the BA network can be made by specifying their 
degree k as well as their respective birth time. We, how- 
ever, find it highly instructive to combine the two into 
a single variable using Eq. (|4]). That is, the node i at 
time t can be characterized by the generalized degree 
qi{t) = kiti where the exponent /3 value is fixed by the 
dimensional relation [q^^^] = [t]. The advantage of using 
q is that it depends only on time t since according to Eq. 
dl]) we have 

t^ 



q^{t) 



with /3 = l/2. 



(9) 



P{k)dk = ~P{ti)dt, 



(5) 



It is now customary to consider the generalized degree 
distribution F{q,t) instead of the traditional degree dis- 
tribution P{k). That is, if we pick a node at random 
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FIG. 2: Generalized degree distributions F{q, t) for m = 1 are 
shown against q for three different network size A''. In each 
case data in the graph represent averaged over 500 indepen- 
dent reahzations. 
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FIG. 3: The same data of Fig. 2 for m = 1 is shown in the 
self-similar coordinates t^^'^F{q, t) and q/t^^'^ and we find that 
all the three curves of Fig 2 collapsed onto a single universal 
curve. 



at time t then F{q,t) is the probability that its gener- 
alized degree is q. In Fig. 2 we have drawn F{q,t) vs 
q for three different network sizes N and found some re- 
markable features. For instance, we find that the value of 
F{q,t) initially increases quite sharply and then register 
a sudden and sharp fall to a non-zero value at qc from 
which it rises again to a secondary maximum followed by 
a smooth decrease with a long tail. The value qc, where 
the first minimum occurs, increases with the network size 
N and at the same time both primary and secondary 
heights systematically decreases as N increases. 

We shall now invoke the idea of the dimensional anal- 
ysis to show that F{q, t) in the long-time large-size limit 
self-organizes into a state where it exhibits dynamic scal- 
ing [2l|. Clearly, there are two governing parameters q 
and t and a governed parameter F{q, t) in the problem at 
hand. However, according to Eq. (|9]) the governing pa- 
rameter q can be expressed in terms of time t alone and 
hence we can define a dimensionless governing parameter 



« tl/2- 



(10) 



It means we can express F(q, t) too in terms of time t 
alone since time t is chosen to be an independent pa- 
rameter. Applying the power-monomial law of the di- 
mensional function of a physical quantity we can write 
a dimensional relation F{q, t) ^ t"' where the exponent 
a assumes a value that makes bear the dimension of 
F{q,t) [2l|. We can therefore define yet another dimen- 
sionless governed parameter </> as follows 



F{q,t) 



(11) 



Now, within a given class one can pass from one unit of 
measurements to another by changing t, for instance, by 
an arbitrary factor leaving the other factor q unchanged. 
Upon such transition within the given class the numer- 
ical value of the quantity on the right hand side of Eq. 




FIG. 4: The value of F{q,t) for m = 1,2 and 3 are plotted 
in the self-similar coordinates. It clearly shows that universal 
curve for m = 1 do not collapse with those for m > 1 and 
hence belong to two different classes. 



must remain unchanged since the left hand side is a 
dimensionless quantity. 

We thus find that the quantity -^-^^ vis-a-vis 4> can 
at best be a function of another dimensionless quantity 
^ given by Eq. (|10l) since this is the only dimensionless 
governing parameter. We can express Eq. as 



F{q,t)^ec^{q/t^/\ 



(12) 



where the exponent a is obtained by applying the nor- 
malization condition F{q,t)dq = 1 to give a = —1/2. 
We thus finally find that the generalized degree distri- 
bution assumes exactly the same dynamical scaling form 
as Eq. An interesting aspect of the structure of the 
dynamic scaling form given by Eq. (jl2p is that the distri- 
bution function F{q, t) at various moments of time can be 
obtained from one another by a similarity transformation 



X^/^q, t^Xt, F — >X-^/^F, 



(13) 



revealing the self-similar nature of the function F{q,t). 
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FIG. 5: Plots of ln{t^^^F{q,t) vs ln{q/t^^^) for m = 1 and 
m > 1. It shows that the scahng functions in both cases 
grow obeying power-law until q/t^''^ < 1 but with exponent 
2 and 2.9 for m — 1 and m > 1 respectively. 



FIG. 6: Plots of \n{t^/^F(q,t) as a function of ^ = g/f^/^ for 
m — 1 and m > 1 which clearly show that the scaling function 
(t>{^) beyond q/t^''^ = 2 and q/t^^'^ = 1.5 respectively decay 
exponentially but with two different decay constant. 



The question is: How do we verify Eq. ([T^ using 
the data extracted from numerical simulation? The best 
way of verifying it is to invoke the idea of data-collapse. 
Note that according to Eqs. ([T0| and (fTTj) the quantities 
are both dimensionless and hence 
the data of F{q, t) for various network size iV should col- 
lapse on a single universal curve if we plot t^^'^F{q, t) 
against q/t^^'^ which are known as the self-similar coor- 
dinates. In Fig. 3 we have drawn the scaled generalized 
degree distribution t^/'^F{q, t) as a function of scaled gen- 
eralized degree qlt^l"^ for three different network sizes N 
considering m = 1 in each case and find that all the 
distinct curves of Fig. 2 merge superbly onto a single 
curve as expected. It clearly well confirms the validity 
of the dynamic scaling. The first discontinuity in Fig. 
3 occurs at q/t^^'^ = 1 and the second discontinuity at 
qjt^l'^ = 2 and both are quite sharp. A careful obser- 
vation also reveals that there exists a third discontinuity 
at qlt^l"^ — 3 which seems quite weak and hard to no- 
tice. Now the question is: What if nodes arrive in the 
BA network with more than one edges (m > 1)? Below, 
we attempt to give an answer to this question. 

According to our dimensional analysis it is expected 
that the data points for the BA network that grow by se- 
quential addition of a node with multiple edges (m > 1) 
should also lie on the same curve as the ones for single 
edge (m = 1) unless they are fundamentally different in 
some subtle way which the ordinary degree distribution 
P(fc) fail to differentiate. Surprisingly, we find that the 
data points of t^/^F{q,t) for m — 2,3,4 etc. do collapse 
on a single curve if plotted against q/t^/'^ but do not co- 
incide with the one for m = 1 (see Fig. 4). Although 
the two distinct universal curves, one for m — 1 and the 
other for m > 1, behave qualitatively almost in the simi- 
lar fashion they are however quantitatively different. For 
instance, one noteworthy difference is that although the 
first discontinuity occurs at the same point as for m = 1, 
the second one, however, occurs at q/t^/'^ = 1.5 for m > 1 



while at q/t^^"^ = 2 for m = 1. These differences can be 
even better appreciated if we plot t^^^F{q, t) vs q/t^^'^ ei- 
ther in the log-log scale or in the log-linear scale as shown 
in figures 5 and 6 respectively. Using these figures, we 
can write the solutions for the universal scaling function 
both for m = 1 

m - { Itl^ (14) 

and for to > 1 

f P-^ if f < 1 

m - { iti,, (15) 

where a « 1.4 and b w 2.5. 

Beside proving the existence of self-similarity by data- 
collapse, we have also found that there are two distinct 
classes of networks resulting from the BA model, (i) 
The network that grows by sequential addition of one 
node with single edge (m = 1) and (ii) the network that 
grows by sequential addition of one node with multiple 
edges (m > 1). There must be some reasons behind 
such behaviour. One apparent difference between the 
two classes of networks is that the clustering coefficient 
Cn = if to = 1 and Cat ^ rather decays like Cjv ^ 
^-0.75 if > 1 for network of size N. Regardless of 
the value m, the universal scaling function ^(^) always 
suffers two discontinuities which divide the nodes in the 
network always into three different classes. For instance, 
nodes which have generalized degree q < t^^^ fall into one 
class, those which have generalized degree within t^^^ < 
q < 2*1/2 for TO = 1 and t^/^ < q < IM^f^ for to > 1 
into another class and finally those which lie beyond these 
limits. 

To summarize, we have proposed a systematic process- 
ing procedure to obtain the dynamic scaling ansatz and 
the self-similar coordinates for the BA networks. To this 
end we have used dimensional analysis and shown that it 
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provides deeper insight into the problem. We have found 
that if the nodes of the BA network are characterized 
by the gcnerahzed degree qi{t) of the node i at time t, 
defined as the product of its degree ki{t) at time t and 
the square root of its birth time ti, then its distribution 
F{q, t) in the long-time limit exhibits dynamic scaling 
F{Qjt) ~ t~^/'^<j){q/t^^'^). We have verified it numerically 
by showing that all the data points for various network 
size N collapse onto a single universal curve if we plot 
t^^^F{q,t) as a function of q/t^^^. Our findings suggest 
that there are two distinct classes of universal scaling 
functions, depending on whether each new node arrive 
with single edge or with multiple edges. The scaling func- 
tions (/>(^) for TO = 1 and to > 1 have some remarkable 
features in the sense that they both suffer discontinuity 
once at ^ = 1 and then the curve for to = 1 at ^ = 2 



and the corresponding curve for to > 1 at ^ = 1.5 albeit 
qualitatively they are very much similar. The two types 
of BA networks are indeed fundamentally different. For 
instance, the clustering coefficient Cn = ii m = 1 and 
obeys the same power-law Cjv ~ N~^-'^^ if to > 1. The 
idea of data-collapse developed for the BA model could 
also be applied on real life data as well as on the existing 
dynamic networks. It would certainly be an attractive 
proposition to check if the generalized degree distribu- 
tion derived from the real life data or from other kinetic 
network too exhibit dynamic scaling with similar univer- 
sal features or not. We intend to continue our work in 
this direction in our future endeavour. 
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